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ABSTRACT 

Head Start is a federally funded preschool prograa 
for disadvantaged children. In this paper the problems of evaluating 
Head Start froB an economic viewpoint are considered and a reanalysis 
of the data collected for the 1969 Vestinghouse learning 
Corporation — Ohio Sniversity study is carried out. An evaluation of 
Head Start is difficult because of conceptual and statistical 
problems. The conceptual problems deal with the difficulty in 
translating the benefits of Head Start into dollar firr^s so that it 
can be determined if the benefits exceed the costs and if Head Start 
is the most efficient social action program. Statistical problems 
arise because it is rarely possible to measure all of the independent 
variables in the model and some of tbe independent variables may be 
measured vitL error; these specification errors can lead to biased 
estimates of program effects. Regression analysis is used in the 
re^analysis of the Vestinghouse data. The findings of the reanalysis 
are comparable vith those of Hestinghouse study but differ because of 
the changes in the structure of the model. The re*analysis suggests 
that Head Start produces statistically significant cognitive benefits 
for vhite children from mother*headed families and minority children; 
there is no evidence from the data to suggest that these benefits are 
permanent. (Author/JH) 
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ABSTRACT 



Head Start is a federally-funded preschool program for disadvantaged 
.:hlldren. In this paper the problems of evaluating Head Start from an 
eco.iofric viewpoint are considered and a rean^.lysis of the data collected 
for the 1969 Westinghouse Learning Corporation-Ohio University study is 
carried out. An evaluation of Head Start is difficult because of concep- 
tual and statistical problems. The conceptual problems deal with the 
difficulty in translating the benefits of Head Start into dollar terms 
so that it can be determined if the benefits exceed the costs and if Head 
Start is the most efficient social action program. Statistical problems 
arise because it is rarely possible to measure all of the independent 
variables in the model and some of the independent variables may be 
measured with error; the paper discusses ways in which these specification 
errors can lead to biased estimates of the effects of Head Start. 

Regression analysis is used in the reanalysis of the Westinghouse 
data. The statistical mode? is similar to the one used in the original 
study, but several modifications have been made. The findings of the 
reanalysis are compatable with those of the Westinghouse study but differ 
because of the changes in the structure of the model. The reanalysis 
suf^gests that Head Start produces statistically significant cognitive 
benefits for white children from mother-headed families and minority 
ct ildren; there is no evidence from the data to suggest that these 
benefits are permanent. 



EVALUATING PROJECT HEAD START 



1. Head Start as a Cocial Action Program of Interest to Economists 

In recent years grea: efforts have been made to integrate the dis- 
-advantaged into the mainstream of the American economy. Programs have 
been developed to retrain workers with o'^soletf: skills and to aid youths 
with less marketable skillfw Wc.or i t*^ t ;>iayed a major role, both 
devising programs of this natv i Wi.g , :he Job Corps) lad evaluating 
them. While economists were emphasising the r-ols of training adults and 
teenagers already in the labor force, educators and psychologists were 
stressir - the need for intervention at a much earlier stage In life — from 
the ages of three to five. Throughout the past decade nany such preschool 
progra s for disadvantaged children have been implemented, vith the largest 
in "^ze being the federally-funded Head Start program. In this paper how 
Head Start can be evaluated from an economic viewpoint Is considered and 
the conceptua' and empiric?! problems encountered in such an evaluation 
described. A reanalysis of the data collected in 1969 for :he first and 
only national evali )On of hVad Stert is then prt?3iv::ed. 

Head Start is . natior,;i preschool program whose purpose has been 
CO prepare chilclrr i from dasadvantagea backgrounds for entrance Into formal 
education in the primary grades. The {inilosophy underlying the program . .. 
that 01*- rea£on children from disadt/ania^-ed backgrounds perform poorly in 
ychool, and hence drop out and rewc ir) inoover ished , is that the home 
environment dons not provide the stimulation and amenitl s found in middle- 
class homes. By intervening between the ages of three aivi live, the program 
seeks to give these children a ''head start" in their attitudes and cognitive 



devij^ ^i^n f It » and thu3 break out of the cycle of poverty. Although origi- 
nslly prepared as an experimental pilot program in 1965, Head Start was 
greeted with such outstanding popularity that the funding for the first 
year was increased from $17 million to $103 million. The program 
continued to grow, both in popularity and in size, and it remains as 
one of the few remnants of the Johnson' Administration's antipoverty pro- 
grams to retain widespread support. 

Because Head Start has been established as a decentralized program, 
individu.a] reacers differ in their structure and ciirr iculum. In addition 
to attem t^ng to raise the level of cognitive devel:pment. Head Start 
progra'^T have tried to jrprove the physical and mental healf ; of the 
parti cJl^iantr , rrncoui ige parents to take an active role in ' revelopm^nr 
of the chilarcn, and improve the sense of dignity and self-worth of the 
Head Start participants. All of these components are import.- - ispects 
o^ Heed Stsrt, but Head Start differs from day care in that it cten.; » 
CO vaise the participants' cognitive development. Because of thiv ^nd 
bejcause the cognitive benefits are easier to measure than the psycho- 
logical and health benefits, this paper is reRtr-'ctel to evaluating the 
togiitive benefits of Head Start. ^ 

Although the evaluation of educational programs has traditionally 
been in the realm of psychology, in rpcent years economists have become 
Tiiteresr.ed in the field. Like on-the-job training and manpower training 
programs, education can be viewed as an investment in human capital; if 
th'i program is successful, the participant will be able to use his human 
capital to increase hi<5- earnings. Economists have generally limited their 
attention to the evaluation of "final" investment'", in hunan capital such 



as higher education and .nanpowor training. Regardless of the effectiveness 
of such programs, the increasingly popular belief that preschool programs 
are an important instrument for helping disadvantaged children requires 
that efforts be made to evaluate programs such as Head Start, 

There are two reasons why economists have done little research in 
evaluating preschool programs, ^irst, such efforts have traditionally 
been carried out by psychologists. If, however, limited funds are 
available for social action programs, decisions must be made on how to 
allocate these funds between preschool children and adults in the labor 
force. It is the economist who is best able to determine the relative 
efficiencies of L^ese two approaches. Second, it is very difficult to 
measure the benefits of preschool programs in economic terms. When a 
person participates in a manpower training program, the benefits can be 
measured as the difference in the present values of what the participant 
earns after the program minus what he would have earned had he not 
participated; although some problems exist in making these measurements, 
re s^nable evaluations have taken place. For preschool education programs, 
now^ver, there is a long lag between the termination of the program and 
the commencement of the monetary returns to the program. All that can be 
measured at the copcIh' ion oT the program are tbo benefits in terns of 
cognitive development. To perform a cost-benefit analysis, benefits 
must be converted into monetary gains that will accrue to the participants 
(and the rest of society) at least eight years after the completion of 
the program. We have obviously not reached the stage wher"* such cost- 
benefit analyses can be carried out, but we can summarize the progress 
that has been made and note areas that will require additional c^search. 
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2. Problems in Evaluating Project Head Start 

To evaluate a social action program such as Head Start it is neces- 
sary to measure both its costs and benefit's. The costs can be determined 
in a straightforward manner, and need not concern us here. As we have 
noted previously, the benefits are much more difficult to ascertain. 
Problems in measuring the benefits of Head Start can be divided into 
conceptual problems and empirical problems, and each of these areas is 
discussed below. 

The conceptual problems are those that arise in developing the 
theoretical model for the evaluation. The first step in the formulation 
of such a model is the determination of what we wish to measure as the 
program's output. The immediate products of Head Start may include 
cognitive and psyo.hological benefits of the program, daycare benefits 
to the parents of the participants, and medical and nutritional benefits 
to the children. These benefits may, however (with the possible excep- 
tion of the day-care benefits), be considered only intermediate goods 
which serve as inputs in the production of the output of ultimate interest 
to economists — earning ability* 

For an evaluation of Head Start we can divide the procedure into four 
steps: (1) the determination of our best estimate of the effect of Head 
Starr, on the cognitive development of the participants fo'" various groups 
of children; (2) the consideration of how confident we can be in t?, j 
accuracy of the es timates; (3) the determination of whether * i*>f btrt its 
of the program exceed the costs; and, (4) a comparison of He*.-- -"art to 
policy alter vitives to determine if Head Start is the most efficient 
program for meeting our goals. If the first and second steps Indicate 



a zero or negative effect of Hrad Start, the subsequent steps are unneces- 
sary and Head Start can be viewed as an ineffective program. 

To ^.stimate the effects of Head Start, a model must be specified that 
includes Head Start as one of the inputs. Because the dependent* variable 
in the model is the cognitive development of the child, the model can be 
viewed as an educational production function where we attempt to relate 
the various input factors supplied by the child, his family and society 
to the output of education. Research concerning educational production 
functions is still in the early stages, and there are problems in deter- 
mining which variables should be included and what functional form should 
be used. To simplify the discussion, it is assumed that a linear functional 
form is appropriate (at least for the range of the available data), but it 
should be noted that substantial research remains to be dora in this area. 
Head Start can be considered as either a binary variable (where a child 
either participates in the program or does not) or as a continuous vari- 
able (where the length of the treatment can vary by increasing either the 
hours per day spent in Head Start or the number of weeks that the program 
lasts). For this illustration it is assumed that Head Start is a continu- 
ous variable. Data must then be collected on the variables in the 
educational production function for a group of children. For reasons 
discussed below, the nature of the data collected will be important in 
determining if the data will produce unbiased estimates of the coefficients 
in the education- 1 production function. The dependent variable in the isodel 
is some measure of cognitive development taken after the completion of the 
Head Start program. Assuming that the model has been properly specified, 
the cognit: measu/e may be regressed on the set of independent variables, 
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and the regression coefficient for the Head Start variable will be the 
best estimate of the effect of one unit of Head Start on the cognitive 
measu*.e used as the dependent variable. If the coefficient of Head Start 
is zero or negative, our work is done — it may be concluded that in Its 
present form Head Start is an inappropriate program for increasing the 
cognitive development of the participants. However, if the effect of 
Head Start is positive, the statistical significance of the coefficient 
must be considered. If the hypothesis of a zero effect cannot be rejected 
at the 1 or 5 percent significance level it is sometimes argued that the 
program should be abandoned because there is no "significant" effect. This 
argument is rarely correct. As Cain and Watts (1970, p. 233) explain, "a 
body of data may be unable to reject the hypothesis that some coefficient 
is zero and be equally consistent with a hypothesis embodying a miraculously 
high effect." It must be kept in mind that the regression coefficiert is 
the best estimate of the treatment effect and that a large standard error 
for the coefficient only implies that we cannot be very confident in the 
accuracy of the estimate. If the regression indicates the Head Start 
coefficient is not significant but that the coefficient is large enough 
so that the effect of the program appears to be worth attaining, a more 
refined evaluation of Head Start should be attcrupted. Some of the tech- 
niques that can be used to increase the efficiency of the estimate include: 
increasing the sample size, improving the accuracy of the ^neasurement of 
variables in the model, and selecting the sample so ♦'hat thc^re is less 
covariation between Head Start and the other independent v.," 'ables. For 
policy decisions the choice of when to dismiss a coefficient as being 
insignificant should be based on the costs of making an incorrect decision 
rather than o»i so.:e arbitrarily selected significance level. 



A Head Start instructor may be satisfied to learn only what the 
cognitive benefits of H».ad Start are, but a policymaker must know more. 
The policymaker must know whether the*, benefits of Head Start exceed the 
costs of the program so he can determine if the program is worth funding. 
The relevant question is not "Does the program work?" but rather, "Is 
the program worth funding?" This part of the evaluation requires placing 
a dollar value on cognitive gains — a very difficult task, but one which 
must be faced. If it costs $1,500 to make Head Start available to one 
child, then the policymaker must decide if the gains produced by Head Start 
are worth at least that much for nhe program to be judged worthwhile. For 
private goods, economic theory asserts that an individual will purchase a 
good only if the benefits he receives exceed the costs. For publicly 
funded social action programs, howe. >r, it becomes much more difficult to 
price the benefits, especially when the benefits are in the form of gains 
in cognitive development for young children. The benefits of the program 
would not necessarily be the discounted increase in earning power due to 
the change in IQ that occurred because of Head Start. If the benefits of 
Head Start fade after the program, then such a procedure wculd overstate 
tha value of the gains. It is also possible that the gains from Head Start 
would allow the child to be placed in a higher "track" in the primary grades, 
thus leading to additional gains. Finally, the externality benefits to the 
taxpaying public and the day-care benefits to the child's parents should 
*e included in the benefits. 

After the benefits of Head Start are expressed in dollar terns, the 
benefits can be compared with the costs of the program. If the costs 
exceed the benefits, then Head Start should not be continued in its present 
form even if it produces significant benefits. For example, we would reject 
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a manpower training program that increased the percent value of an indi- 
vidual's lifetime earnings by $10,000 if the program cost iJlS.OOO per 
participant. ^ 

The final step in the evaluation is to compare Head Start with alter- 
native policies. To illustrate this point we shall us2 the following 
hypothetical example: assume that the educational production function 
included three policy variables : Head Start (Z) , kindergarten (K) , and 
cash transfers (I). Further assume that Heari Start and kindergarten are 
continuous variables. Children can receive any combination of the policy 
variables, but it is assumed that the relationship between the cognitive 
te«3t score (Y) and all independent variables is linear and additive with 
no interaction effects. 

The first step in the analysis is to regress the cognitive measure 
Y on the three policy variables and all other independent variables 
specified by the model. Suppose that the fitted regression equation is: 

N 

Y = + .00051 + l.OOZ + 5.00K + Z 6 X 

^ i=l 

where the X variables represent the nonpolicy variables such as age, race, 
and Gex. Because the regression coefficient for Head Start is positive, 
we can proceed to compare the costs and benefits of the program. For this 
example assume that one unit of Head Start costs $1,000 per child. If the 
policymaker decides that a gain of one point on the dependent variable and 
the other gains from the program are worth at least $1,000 then the program 
can be judf^ed a success. The policymaker can then compare Head Start with 
the policy alternatives to see which is the most effective method of increas 
ing the cognitive development of disdvantaged children. Suppose that t>ie 
following cost information is available: 
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cost of a unit of Head Start = $1,000 

cost of a unit of kindergarten « $2,000 

cost of income transfer - $1 

By combining the cost information »;ith the regression coefficients we can 
arrive at the cost-benefit ratios ljt the three policies: 

pcints/dollar for Head St-^rt « .0010 

points/dollar for kindergarten = .0025 

Doints/dollar for vicome transfer = .000:) 

Thus io.. the example presenttid here, Hear' Start is . ot the most efficient 

way of increasing the cognitive development of children; kindergarten 

2 

provides more benefits per dollar of expenditure. 

It is considerably more difficult to compare Head Start with social 
action programs for those already in the labor force. For such compari- 
sons it is necessary to either select som, tradeoff between cognitive gains 
for preschool children and earning oower for adults or extrapolate the 
cognitive gar'ns ano determine how they affect earning power. As most 
evaluations of prt:school programs have found that the gains disappear 
within three years, using the secona criteria would lead to the abandon- 
ment of preschool programs as a means of increasing earning power. 

The procedure for determining the appropriate policy instrument 
becomes more complex if externalities are considered or if Head Start 
and kindergarten can be provided at one treatment level. When external- 
ities are present, the program that is most efficient tor increasing 
cognitive development or earning power may not be as politically accept- 
able as an alternative policy. Thus, even if a manpower training program 
were more effective than Head Stait, taxpayers might prefer that their 
money be spent on children rather than on adu js. Indeed, the only 
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justifications for in-kind transfers are paternalism and certain types 
of externalities. If Head Start and kindergarten are available only 
in one quantity (where a child either participates in a program or he 
does not, but the duration of the program is fixed) then there is an 
interesting equity-efficiency tradeoff. In the example presented above, 
the largest aggregate amount of benefits would be produced by concentrat- 
ing all of the funds on kindergarten; yet this procedure will provide 
benefits to a smaller number of children and may be criticized on equity 
grounds. The policynakrr must then decide how to weight the equity and 
efficiency aspects * ir.aking his decision. 

The forn;ulatior ♦ conceptual model offers no guarantee that an 
uaDiased evaluatior ^r^n be carried out. For evaluations of programs 
'•u:h as Head Start it is generally not possible to directly test the 
theoretical model. Specification errors occur because one or more of 
che indeptndert variables are undesirable or are measured with error. 

To illu.strate thri/d o^oblems several simnle models will be developed; 
a more comp .ete discussion of these models and several additional models 
can be foun^' in tat. works by Barnow (1972 and 1973) and Goldberger (1972a 
and 1972b) The underlying mcdel considered is one where a child's cog- 
nitive developii'-nt in period 2 is a linear function of his cognitive 
devel -ne'it n pf»/ od 1 and the gains he receives from participating in 
Head '.(an. To sirplify the analysis Head Start is assumed to be avail- 
able a"" only one le/el, and the dummy variable in Head Start, Z, is 
definec so that Z=l if a child has participafed in Head Start and Z=0 
if he has nol. Cognitive development in period 1, prior to Head Start 
experience, is represented in the model as X*, and measured cognitive 
development in period 2, after Head Start experience, is denoted as Y. 
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It is important to note that the variable Y is i fallible measure of 
cognitive development but that X* contains no measurement error. (In 
the psychological literc.*^-r^ X* is referred lj as a true score.) To 
complete the iiodel, the disturbance terni associated with the posttest 
is denoted ao v. The model may now written as: 

Y = 6^ + B^X* + B^Z + V. (1) 

It is necessary to include a control group of children who have not 
oarticipated in Head Start to estimate B^* vhic i is the treatment effect. 
If we simply regressed Y on X* for children wfo have ' r Head Start * v: 
woulc? h jnable to differentiate* between normal cognitive grow; x ac the 
effe* o'' Head Start; this problem would be f»rT^iCi'*l y gre " if treat- 
ment is of long duration. 

Lr'^ortunately it is rarel^^ ^.ossibTd to directly estimate *»quation 
(1). The level nf co^pii development prior to the treatment is usually 
unav lilable or availab v measareir nt error. Test< jhat can be used 
to "measure pretreat m . ti-'e dev» opment may provide ar unbiased 

measure: oi X*, but the ai subjr to error. Denoting a f.*llible but 
unbiased measure of ; tcreatmen : ultive development as X we may write 

X = X* 4- . (2) 

where u is assume to be normally distributed with a zero mean and uncor- 
:*eiated with /* ..e variabj '< is generally referred to as a pretest and 
Y h:, referred co vs a pos^^oPt, It will also be assumed that u is indepen- 
dent of V, X > T. Z; more formally this can be stated as: 
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Cov(u,v) = Cov(u,X*) = Cov(u.Z) Cov(v,X*) = Cov(v,Z) = 0 (3) 

Assuming that X and not X* is available for ar ^.valuation, we are inter- 
ested if the regression coefficient of Z will be the same when the linear 

3 

regression of Y on X and Z is run rather than Y on X* and Z. Thus when 
we determine 

E(Y|X,Z) = + a^X + a^Z (4) 

we wish to discover if the a coefficients are equal to the. corresponding 
6's. It can easily be demonstrated that, in general, errors in the measure- 
ment ot one of the regressors leads to bias of all regression coefficients. 
For an evaluation of Head Start it is especially important to learn the 
relationship between and B^* First we make the fallowing definitions: 

0^^ = Var(X*), = Var(Z), a^^ = Cov(X*,Z), a^^ = Var(u), (5) 

. 2 ^^*Z>^ 
F = 7 , r = . 

** UU ** ZZ 

Note that the parameter P is the ratio of the variance of the true measure 

of pretreatment cognitive development to the variance of the pretest and 

2 

that 0£P£l. The parameter r is the squared coefficient of correlation 
between X* and Z. When the normal equations for (4) are solved, it can 
be shown that: 

There is no bias in equation (6) only under certain conditions. One example 
is if P=l then ^^^^^y^* ^^ut this requires that X*=X and that there is no measure- 
ment error. Another is when 6^=0; i.e., tha variable measured with error 
does not belong in the regression. This case is theoretically untenable 
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because it implies that a child's cognitive development in the second 

period has no relationship to his cognitive development in the first 

period. A more interesting case is that a._=0 suffices for a„=6,. 

Z Z 

But cr^^=0 is equivalent to E(X^] ')=E(X*), which says that the mean of 
pretreatment cognitive devel^^pment is the same in the experimental and 
control grouj^s. inJor\ seleition and matching on pretests would lead 
to this result , J refore avoid bias. Note that if cr^2<0 (the experi- 
mental group initial a-q lower cognitive development) then ot«<6„ and 
the bias ould t.- to underestimate the treatment effect. 

Thfj ,*>rimary lesson to be learned from this exercise is that when 
one of the regressors is measured with error the estimate of the treatment 
effect may be biased* If the Head Start administrators used a "scraping" 
procedure whereby children with the lowest levels of true cognitive,, 
development were assigned to the experimental group and the central group 
selected from the more able children, regression analysis would lead to 
an underestimate of the true effect of Head Start. Random assignment 
to the experimental and control groups is the most appealing way to avoid 
bias in evaluations, but random assignment is not always possible. In 
many instances an evaluation must be carried out ex post facto or certain 
criteria must be used for admitting children to the experimental group. 

The < /.a which we analyze in the next section of this paper were 
collected for an ex post facto evaluation of Head Start. Although no 
direct measure of pretreatment cognitive development is available, when 
certain assumptions are made the socioeccnomic and demographic dati 
collected at the time of the posttest can be used as a proxy of pretreat- 
ment cognitive development. The following are the assumptions that nust 
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be made to utilize the socioeconomic and demographic variables: (1) the 
variables must be a function of cognitive development, (2) we must know 
how to specify the relationship between the set of independent variables 
available and cognitive development, and (3) exposure to Head Start does 
not affect the socioeconomic status of a child. If we make these assump- 
tions then the information collected on the background of the Head Start 
and control children can be used as a fallible measure of pretre?t:aent 
cognitive c^evelopment in statistical analyses. 

An unfortunate drawback of ex post facto analyses is that unless 
the process by which children were assigned to the treatment groups is 
known, it i& 'mpossible to determine if regression analysis will produce 
unbiased estimates of treatment effect; it may even be impossible to 
discuss the direction of the bias. For the errors-in-variables model 
developed abovti, discr iminart analysis can be used to test the hypothesis 
that the Head Start and contro] g: oups differ significantly on a set of 
variables thnt - s associated rit^ cogrxtive development. Hov/ever, that 
model is consistt^nt only 'ith ^ ort< «ection procedires — such as when 

selection ir jaserl on true scores whe- ;vi'''^tion Is ased on one set 
of variables nd ai:othor set if> available f"i the evalur? . ioi; - and other 
t^election proc»:dure:i may not Irad to biased estimates when the Head Start 
c.nc coiJ:rol groups differ* in their pretreatment cognitive development. 

For example^ if all eligible children were ranked on some socioeconomic 
xneastLi^ 3nd t 9 child!% n in he lower half of the disliibution were entered 
ir Head Start while the upper halt were then placed in the control group, 
the control ^ '.)up would have a higher mean level of pretreatment cognitive 
development t) un the Hei.d Start group. But it has been demonstrated by 
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Goldberger (1972a) and Barnow (1972) that this selection procedure will 
not lead to biased estimates of the Head Start effectb when the posttesL 
score is regressed on the selection variable (X) and the dummy variab t** 
for Head Start. The reason that the errors-in-variablrs model described 
above does not apply when this selection procedure is used is *;hat the 
assumption that Cov(u,Z)=0 is violated. Several additional selection 
procedures are analyzed by Barnow (19*3) and Goldberger (1972a), and it 
is demonstrated that one must know the selection procedure used before 
any conclusion can be drawn on whether a given uata set will produce 
biased estimates of the effects of Head Start. 

3. Estimating the Cognitive Benefits of Head Start 

By 1969, Head Start had been in existence for three years, and 
officials at the Office of Economic Opportunity felt that the program 
was ready to be evaluated. Several long-term evaluations were planned 
including a seven-year longitudinal study and a planned v^rietion study. 
However, these evaluatiors would not provide usetal infomation for several 
years so \ ./itract was awarded to the Westinghouse uearning Corporation 
in conjuncLiCLi with Ohio University to carry out a one-year ex post facto 
evaluation of .^,ummtr aad full year Head S,.c.rt programs throughout the 
nation* Although the V?estingnouse rese?4rrhers attempted to determina 
the affective as well ay the cognicive benefits of Head Starts in this 
paper attention is restricted to the Istut^? becausr I feel that not only 
are cognif.ive aspects more important but there is little evidence to 

Indicate-^ that the affective tests designed lor the Wastin^h-^use evaluation 
4 

are satisfactory. 
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To carry out the study the Westinghouse researchers randomly selected 
300 Head Start centers from a list of all centers • From this sample 225 
were s rreened until 104 centers (29 full-year centers and 75 summer centers) 
-pr? found that could participate in the evaluation; although Smith and 
3i Sti'l (1970) argue that the dropped centers were the most eift^ctive ones, 
taey offer no .strong evidence to support this view. From each center a 
rand 'm sample of eight former Head Start participant: /ho wore in each 
of rhe first, second, and third grades at the time )f evaluation were 
selected. The Westinghouse researchers recogni^ea the weaknesses of ex 
post facco evaluations and established the following criteria for consider- 
ing children for the Head Start or control populations: 

(1) Continuity of residence in the target ar^a. All subjects 
mi St have lived in the target area from the time of the 
specified Head Start program until the time of the study. 

(2) Eligibility for Head Start. All subjects must have met the 
eligibility requirements for participating in the Head Start 
program. 

(3) Equivalent school experience. All must have attended the 
same school system. 

(4) No other Head Start experience. (Cicirelli et al» , p. 36. 
For the remainder of the paper all page references refer 
to ricirclli ec a,i., unless otherwise specified.) 

Within each ot the 104 neigliuorhoods a control sample itf. ei^ it children 
at each grade V."»v was then selected. In addition 1 fulijUing the 
four criteria listed above, the control children were tielcccej. to mtch 
the Head Start sample on the basis of age, se: kinc^ir ;artan attendance, 
ana social/ethnic characteristics. The groups ero ot matched in socio- 
economic status (SES) , but informatiofi on SES ^as .allected and osed a3 
i. covariate in the statistical ana'ys.s. 1 -^i vr' < :ion on the SES of the 
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children was collected by interviewing their parents. In addition, 
children in the final sample were given a series of cognitive and 
aftoctr/e tests. 

Tho primary method of .statistical analysis used by the WestiaghDuse 
researchers was analysis of covariance. The equivalent regression equa- 
tion for their analysis is 

N 

o Z 1 1 ^ ^ 

where Che variables ^ re (it»fined as: 

Y - tht score on the cognitive test 

Z = a iummy variable for experimental status where Z « 1 

for children who participated in Head Start and Z » 0 
fc. ::he children in the control group 

Xj^ = t '3 measure of SES used in the study 

X2*..X^ " ^ series of dummy variables for the N neighborhoods or 
i ar^ei areas where Xj = 1 if the child is from neigbor-* 
:oo(: ] and X j = 0 otherwise 

u = a dir3c;jrbance term assumed to be normally distributed 

a mean of zero and constant variance » and assumed 
to oa uncorrelated with the independent variables. 

The Westinghouse researchers felt that the data should be grouped by neigh- 
borhood rather than use individual children as the unit of observation; 
for each neighborhood there were thus two observations — one for the Head 
Start children and one for the control children. The mean value for each 
grouo on each variable wd3 determined and used for the values for that 
observation* 

The variable wjed tr cMitrol for SES in the Westinghouse scudy is the 
**Two-Factor Index of Soci 1 ^^osition" described in Hollingshead (1958). The 
Hollingshead Indrx is rx vo-ighced sum of the head of household's occupation 
and educational i- linment where both factors have been divided into a 
seven^point scale. 
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The children in the samples? were given both an ability (int elligence^ 
test and an achievement test. The ability test ised in the study is the 
revised edition of the IV li iois Tes^: of Psycholinguistic Abilities (ITPA). 
The object of the ITPA, aw ording to Kirk ec al, (1970, p. 5) is to 
"delineate specific abiliticc and disabilities in children in order t.iat 
remediation be undertaken when needed. It is a diagnostic test of specific 
cognitive abilities as well as a molar test of intelligence." The ITPA is 
strucfzurcci so that it is applicable for children from the ages of two to 
ten, and it was administered to the children in all three grades. The 
achievement tests that the children took are the Metropolitan Readiness 
Test . Stanford Achievement Test Primarv I Battery , and the Stanford Achieve- 
ment Test Primary II Battery for the first, second, ar-i third grades, 
respectively. 

Each of the cognitive test scores was used as a dependent variable 

for full-year and summer samples for the first, second, and third grades. 

In addition, the samples were further stratified by peop,raphic region> 

racial/ethnic characteristics of the centers, and by t3rpe of popxilziion 

units in which the centers were located. Most of the analyses conducted 

by the Westinghouse researchers indicated a zero or negative effect for 

summer Head Start and a small positive effect for full-year Head Start. 

The study concludes that: 

In summary, when one lookb at the observed effects of Head Start 
according to the test of practical relevance, it must be concluded 
that the effects found on the standardized tests are indeed small 
in magnitude, with the exception of a few differences found in sub- 
groups of full-year centers on the ITPA, and do not meet the 
criterion of practical relevance (p. 168). 

The Westinghouse report concludes that summer Head Start programs should 

be phased out as soon as possible and replaced by full-year programs, and 

that attempts she:: ' ! be made to strengthen the full-year programs. 
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T' v\ )ps because of the ex post facto design, and to some degree 
beciuirc cf tue unpopular nature oi the finding?, the Westinghouse study 
bar. htt^^n the subject of many criticisms. Alilscugh some critics have 
argued ti. r- the questions addressed by the study are incorrect and that 
the cognitive tests used may be inappropriate, the most serious criticisms 
deal wlch the selection of the Head Start and control children for the 
study and whether or not the estimate of the effects of Head Start found 
in the r^tudy are biased. Although the crucial question of whether or 
not thr Westinghouse data can be used to prc<iuce unbiased estimates of 
the effects of Head Start can neveA be answered, some critics feel that 
the V stiDghouse researchers did not analyze the data in the best way 
a) d * hat a reanalysis would produce a better estimate; indeed, the results 
( ^ .ir reanalysis are consistent with the Westinghouse findings but demon- 
sti<j;Ce that Head Start may be -rfective for certain groups of children. 

To improve the statistic*^], analysis of the Westinghouse data, we 
have uiade several ctauges in the regression equations used. Whatever 
the nc'^its of che Hollingshrad Index are ii t ensuring status, there is 
no reason for including socxoeconomic infom.ation in that form for a 
model of cognitive develcpnt'/iC. Althoagh the information that is used 
in computing t'^e Holl Incsh- i '. Index can be included jn r' ,e model either 
tr. r.er' j rs .-^ m »c> for oretreatment cognitive development or as indepen- 

'4 

dent .^tria ;il^^ in thei" owr^ right, the information caii -etter be Inclided 
as s* .J of dummy itiatw*: for two reasons: (1) the hoxlingshea' I:ict: 
con? Mains tae cc tfi.^f ut for educational achievement to be f our-s?veiths 
of t ^ coefficlen: for occupational status, whereas including the v^ iables 
indepc'.* y permits the coefficients to vary freely;^ and (2) when Ihe 
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information is entered as precoded variables the effect of moving from 
any one step to the next highest is constrained to be the same, but the 
use of dummy variables allows the effect to vary from step to step. As 
knowledge concerning the relationship between cognitive development and 
SES is very limited, one should permit the functional form to be as 
general as possible. 

Variables not included in the Hollingshead Inde\ are ilso appropriate 
for inci jsion in the regression model. By expanding ch^i , ^f independent 
variablris we expect to reduce the possible bias in t coeii ci^.nt for Head 
Start cai sed by nonrandom treatment assignment. In adJition, the varL<ibles 
that we have included are useful in helping us to learn more about thc5 
educational production process, and several of the variables can be used 
as policy variables in addition to or in place of Head Start. A list of 
the variables employed in the reanalysis and a description of how the 
variables were formed is given in Table 1. When a single trait, such as 
mother's education, is represented by a group of dummy variables (i.e., 
MSOCOL, MHSG, MSOHS, M79, and M06) we have generally followed the practice 
of omitting the variable for the highest category from the regressions. 

A second change that ha5 been made for the reanalysis is that individual 
rather than grouping data ar^^ used. Cramer (1964) has demonstrated that group 
ing leads to a loss of efficiency, and Blalock (1961, pp. 102-112) shows that 
grouping can sometimes bias the regression coefficients. For these reasons 
we have used individual date and thereby increased the number of observations 
by a factor of eight. 

"he final change in the method of analysis that we have employed is 
that ve have stratified the data differently than the Westiaghouse researchers 
did : X parmlt Head Start to have a different effect for various classes of 
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TABLE 1 

Description of Variables Used In the Peaiialysis 

Variable Description 

CHILD Number of children in the family 

INCOME Total annual incone of the child's family in dollars 

AGE Age of the child to the nearest year 



MSOCOL 


1 


if child's 


mother 


has more than 12 years of education; 




n 
\j 


C: L ncL W LOC 






MHSC 


1 


if child's 


mother 


has 12 years of education; 0 otherwise 


MSOHS 


i 


if child's 


mother 


has 10-11 years of education; 0 otherwise 


H79 


1 


if child's 


mother 


has 7-9 years of education; 0 otherwise 


M06 


1 


if child's 


mother 


has 0-6 years of education; 0 otherwise 


MOPRO 


1 


if child's 


mother 


has professional or managerial occupation; 




0 


otherwj.S€2 








1 


if chllr's 


mother 


has clerical occupation; 0 otherwise 


MOSKIL 




xi ch'.:. ;'s 


mother 


has skilled occupation; 0 otherwise 


MOSEMI 


1 


if child's 


mother 


has semiskilled occupation; 0 otherwise 


MO'JNSK 


1 


if child's 


mother 


has unskilled occupation or no occupation; 




r 


ethe r* ise 






FEK.\LE 


1 


■..f cai\:\ 'S female; 0 otherwise 


MALE 


i 


if child is male; 


0 otherwise 


RUKAL 


T 

X 


if child lives in 


a rural area; 0 otherwise 


KIND 


1 


if child attended 


kindergarten; 0 otherwise 


IND 




if child did not i 


ittend kindergarten; 0 otherwise 


FSOCOL 


1 


it child's 


father 


has more taan 12 years of education; 




0 


otherwise 






FHSG 


1 


if child's 


father 


has 12 years of education; 0 otherwise 


FSOHS 


1 


if child's 


father 


has 10-11 years of education 0 otherwise 



TABLE 1 (continued) 
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Variable Description 

779 1 if child's father ha3 7-9 years of educa*^ion; 0 otherwise 

F06 1 if child' ; father has 0-6 years of education; 0 otherwise 

FAPRO 1 if child's father has professional or managerial occupation; 
0 otherwise 

FACLER 1 If child's father has clerical occupation; 0 othervise 

FASKIL 1 if child's father has skilled occupation; 0 otherwise 

FASEMI 1 if child's father has semiskilled occupation; 0 otherwise 

FAUNSK 1 if child's father has unskilled occupation; v otherwise 

WHITE 1 if child is whine; 0 otherwise 

BLACK 1 if child is black; 0 otherwise 

MEXAM 1 if child is Mexican- American; C otherwise 

HDSTRT I if child has had Head Start; 0 otl erwise 

BLKHS 1 if child is black and has had Head f'tart; 0 otherwise 

ITPAMN mean of child's nonzero scores or lTVf\ 

MRTMN mean of child's nonzero scores on KP.T 

SAT2MN mean of child's nonzero scores on SAV2 

SAT3MN mean of child's nonzero set res on //AT3 

DIVOR 1 if child's parents divorcod; 0 o*">^'*tuise 

SEPAR 1 if child's parents separated; 0 otherwise 

WIDOW 1 if child's mother is a widow; 0 otherwise 

NEVMAR 1 if child's mother never m;. ried; 0 otherwlue 
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children. We have stratified by type of program (full-year and summer), 
grade (first, 'Second, and third), and by parents present (both and mother 
only). The stratification by parents present is necessary because the 
set rf independfpi variables is different when only the mother is present; 
the variables for occupation and education of the father must be dropped 
and we have aJded variables for the marital status of the mother. There 
are ten rather than 'velve samples because there were too few third grade 
full-year children Co analyze. Stratification by race was attempted, but 
it was discovered that only the coefficients for Head Start and kindergarten 
differed significantly for Blacks and whites so interaction variables were 
added for the final analysis Instead of stratifying by race. 

The complete set of regression results from the reanaiysis can be 
found in Baxnow ''1973). ]n the present paper some of the important r>?sults 
are reproduced :;nd the major findings are summarized. In Table 2 and Table 
3 the means anc standard deviations are presented for the first grade, 
summer and fu] t-year, both parents present samples. The tables indicate 
that the child^Tisn in the sample are indeed from disadvantaged backgrounds. 
The average n^-mber of children per family is as high as 4.8 and annual 
family incorii is as low as $4,861. Although both the Head Start and control 
children co:he rom disadvantaged families, the Head Start children are on 
averagf: more sad antaged as is indicated by income and family size. Note 
also that the xe^ns of the cognitive tests (ITPAMN and MRTMN) are higher 
for the control groups than for the Head Start groups. Thus, if we regressed 
cognitive ti ^'elopment on treatment status alone we would find a negative 
effect for Head Start. 
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TABLE 2 





Means and Standard Deviation for 
Summer, Both Parents Present 


Grade 1, 
Sample 










Head Start 








Control 


Variable 


Mean 


Standard 


Deviation 




Mean 


Standard Deviation 


CHILD 


4.69 




2.10 




4.12 


2.04 


INCOME 


5049. 


2514. 


5859. 


2986. 


AGE 


5.89 




.446 




5.95 


. 548 


MHSG 


.263 




.441 




.416 


.494 


MSOHS 


.362 




.481 




.237 


.426 


M79 


.194 




.396 




.192 


.394 


M06 


.092 




.290 




.065 


. 247 


MOCLER 


.109 




.137 




.042 


.201 


MOSKIL 


.032 




.176 




.023 


.149 


MOSEMI 


.130 




.337 




.078 


.268 


MOUNSK 


.794 




.405 




.834 


.y,2 


FEMALE 


.511 




.501 




.490 


.501 


RURAL 


.263 




.441 




.234 


.^24 


SO 


.225 




.419 




.295 


.457 


FSOHS 


. 222 




.416 




.286 


.452 


F79 


.263 




.441 




.175 


.381 


F06 


.165 




.372 




.123 


.329 


FACLER 


.041 




.199 




.045 


.209 


FASKIL 


.197 




.398 




.240 


.428 


FAS EMI 


.311 




.464 




.351 


.478 


FAUNSK 


.403 




.491 




.263 


.441 
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TABLE 2 (continued) 









H^Afi Start 




Control 


Variable 




Standard Deviation 


Mean 


Standard Deviation 


BLACK 


.289 


.454 


.247 


.432 


MEXAM 


.133 


.340 


.107 


.310 


KIND 


.625 


.485 


.614 


.488 


BLKIND 


.194 


.396 


.175 


.381 


ITPAMN 


19.13 


3.52 


19.30 


J. 4 


MRTMN 


9.03 


2.60 


9.31 


2.74 



N 



315 



308 
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TABLE 3 

Mea\:s and Standard Deviations for Grade 1, 
Fu] 1-Year, Both Parents Present Sample 





Variable 




Head Start 




Control 


Mean 


Standard Deviation 


Mean 


Standard Deviation 


CHILD 


4.83 


2.17 


4.74 


2.01 


INCOME 


4861. 


2252, 


5490. 


2656. 


AGE 


5.97 


.492 


5.96 


.420 


MHSG 


.385 


.489 


.304 


.462 


MSOHS 


.308 


.464 


.294 


.458 


M79 


.192 


.396 


.196 


.399 


M06 


.067 


.252 


.088 


.285 


MOCLER 


.019 


.138 


.039 


.195 


MOSKIL 


.019 


.138 


.029 


.170 


MOSEMI 


.173 


.380 


.157 


.365 


MOUNSK 


.769 


.423 


.735 


.443 


FEMALE 


.510 


.502 


. 539 


.501 


RURAL 


.212 


.410 


.245 


.432 


FHSG 


.308 


.464 


.304 


.462 


FSOHS 


.260 


.441 


. 196 


. 399 


F79 


.173 


.380 


.147 


.356 


F06 


.192 


.396 


.157 


.365 


FACLER 


0.00 


0.00 


.029 


.170 


FASKIL 


.240 


.429 


.284 


.453 


FASEMI 


.356 


.481 


.363 


.483 


fAUNSK 


.337 


.475 


.245 


.432 
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TABLE 3 (continued) 

Head Start Control 

Variable Mean Standa r d Deviation Mean Standard Deviation 

BUCK .5C0 .502 .480 .502 ' 

MEXAM 192 .396 .157 .365 

KIND .558 .499 .588 .495 

BLKIND .2iil. .455 .304 .462 

ITPAMN If.BA 3.84 19.66 4.26 

MR'niN 8.7/1 2.53 8.81 2.68 

N 104 102 
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The regression equations "or the first grade, suiiimer and full-v»ar, 
both parents present samples wirh the ITPA score as the dependert var .able 
are in Table A. Some of the coefficients did not have the expecled sign 
and require some explanation.. li was expected that in'iome would have a 
po 1 ,ive coefficient, but for the suinmer sample and some of the other 
sainp'^ei, ' )me \. ' -'/i t iMve I » ins jgnlf ant coefficient. This cou 
' ♦ % * 3LC\vt .he f luca!\ ^ ;.d occu ation va^'iables capture - f o^'cmanei ' 
inc( 'le titatu5 of \ i ' vs * imily and that measured incov)^ ivclud j^ 
transfer payments receive, the family; unfortunately, the Vootinghous : 
data do not break down the income information by source. 

The information for the occupation and education of the parents Is 
included in the regression equations as sets of dumtuy variabl« s with the 
highest categories used as the reference g'oip. For a nationally repre- 
sentative sample we would expect tLo coefficients within each set to 
become increasingly negative as we progress to lower levels of education 
and occTipation. However, the Westinghouse data are not representative 
of all families. Only children who were eligible for Head Start are in 
the samcle, and because Head Start sought to enroll disadvantaged childreni 
the parents with high levels of education (some college) and occupation 
(managers and professionals) must be atypical of the general population 
of these parents; parents wiLh sui h high levels of SES whose children 
are still eligible for Head Start may be considered as ''j[ailures*" The 
Implication of this sample truncation is that children v^hose parents are 
in the base groups for the educational and occupational variables may not 
only be expected to score lower on cognitive tests than children of parents 
with similar levels of attainment in the general popi^lation, but may even 
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Effects cf Individual Characteristics for Grade I, Both farents 
Present Scrnple^ on Child's ITPA Score, r.ummer and Full Year 





Indeper dent Var ia ble 


ouiTuiier 


ruii xear 


Qt LD 




-.355** 




(-1.900) 


(-2.720) 


INC ME 


UUUUi 






(-.17000) 


(3.2900) 


AOL 








(2,790) 


(2.52) 


MHSG 




— 9 Ql 




(-.350) 


(-.260) 


wo At I O 

MSOHS 




ceo 




(-.870) 


(-.480) 


WT ft 

M79 








(-1.150) 


^-.97) 


M06 








(-1.97) 


(-.330) 


MOCLEF 




o o c 
-2. 95 




(1.97) 


(-1.39) 


HOSKT • 


1 . OO 






(1.51) 


(-.400) 




1 CO 


1 1 "J 
-i. 1 J 




(.150) 


(-.70) 




. DUD 






(.520) 




r rIALb 


— i '1 


. J> OH 






(.750) 


RLRAL 


.240 


1.17 




(.700) 


(1.03) 


FTiSG 


. )41 


-1.73* 




(.690) 


(-1.90) 


FSOHS 


-.003 


-.950 




(-.010) 


(-.990) 


F79 


-.475 


-2.16* 




(-.890) 


(-1.99) 
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TABLE 4 (continued) 
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Summer 



Rill Year 



F06 




- .905 
(-1.450) 


-1.07 
(-.95) 


FACLER 




-.921 
(-1.080) 


1.25 
(.52) 


FASKIL 


\ 


-1.01 
(-1.63) 


.576 
(.490) 


FAS EMI 


-1.17* 

(••i>50) 


-.221 
(-.190) 


FAIJI^SK 




-1.64** 
(-2.57) 


-.040 
(-.030) 


hLA!. 




-2.10** 
(-3.11) 




MEXAM 




-1.12*- 
(-1.69; 


-.4 j9 
(-.330/ 


KIND 




.862** 
(2.500) 


3.4'** 
(3.-^4) 


BTJCTND 




-.593 
( -. 900) 


-3.56** 
(-3.11) 


uriQTPT 




-. 361 
(-1.000) 


-1.22 
(-1.38) 


'<'.CKHS 




1.99** 
(3.13) 


2.04* 
(1.79) 


Mt:XKS 




1.53* 
(1.78) 


.081 
(.050) 


CONSTANI 




16.67** 
(1.98) 


12.00** 
(2.82) 






.186 


.374 


N 




623 


206 



Note: t-statistics are in parentheses below their coefficients. 
*Statisti'^aily significant at the 10 percent level. 
♦^Statistically significaVit at the 5 percent level. 
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score *ower than children lr\ the Westlnghcui^i sample whose parents are 
in some the lower categories. 

Blacl and Mexican-America.i ^-aildren have been observed to score 
lo.*f.r than white children on m^ ny ability and achievement tests. We lack 
the expertise to determine if such findings Are due to real differences 
or c».. tural bias in the tests. It shou .cl noted that for the Westing- 
house data the Black and Mexican -America a children come from significantly 
more disadvantaged backgrounds th n the white children, and the negative 
coefficients found for minoril / )Jiroup children may be due in part from 
failure to adequately control for thesi differences. 

Head Start and kindergarten r.re the two most important policy 
variables in the r<-gression that can be used to affect cognitive develop- 
ment. To allov these variables to have different effects on children from 
different ethnic backgrounds, interaction variables (BLKIND, BLCKHS, and 
MEXHS) have been included. To determine the effect of Head Start on a 
Black child, the coefficients for Head Start and the Black-Head Start 
interaction term must be added. The coefficient of kindergarten for whites 
and Mexican-Americans (which was constrained to be the same because there 
were few Mexican-Americans) is positive in both the full-year and summer 
samples, but there is a great difference in the magnitude of the coeffi- 
cients. Judging from other results, we suspect that the true effect lies 
somewhere between the two values. Our data indicate that kindergarten his 
almost a zero effect for Black children; no good explanation for this 
finding has been d€:veloped. It is possible that the selection process 
for kindergarten works differently for Blacks than it does for whites. 
The coefficients for kindergarten are subject to the same potential bias 
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problem as Head Start » only more so. This is because no effort was made 
to match the kindergarten and nonkindergarten children, and discriminant 
analysis indicated that the kindergarten children come from significantly 
more advantaged backgrounds for all of our samp.les. 

Head Start has a negative, insignificant coefficient for white children 
for both the full-year and summer samples. The Head Start^ethnicity inter- 
action variables are positive in both samples, and the Black-Head Start 
interactio'^. is significant at the 5 percent level for the summer sample. 
The nen 'ft 'Jt of Head Start for Blacks is 1.6 ITPA points for the summer 
sample and .32 ITPA points for the full-year sample. It is surprising to 
find a smaller effect for full-year than summer programs, and we suspect 
that this difference may be spurious. Fov Mexican-American children, 
SJmm3:T Head Start has a positive effect (but smaller than for Black children), 
and full-year Head Start has a negative, insignificant effect. 

To .Interpret the practical significance of the regression coefficients 
it is necessary to understand what a change of one point on the cognitive 
test means for the child. In the ITPA manual Kirk et al. (1968) offer two 
Interpretations for the ITPA score. For children 5 to 7 years old, the 
manual suggests that an increase of one point on the ITPA score is approxi- 
mately equal to a gain of three months of psycholinguistic age or three 
months of mental age. For children 5 to 7 years old a gain of one point 
on the ITPA is therefore equal to a gain of four to five IQ points. Thus, 
the net effect of summer Head Start for Blacks can be expressed as either 
1.6 ITPA points, 4.8 months of mental age growth, or a 6.4 to 8.0 gain in 
IQ points. As IQ is the most widely used measure of mental ability, speci- 
fying the gains in an IQ metric allows for the easiest interpretations of 
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Head Itart effects. Regressions using the Metropolitan Readiness Test 
(MFl) as the dependent variable are not presented here because the 
coefficients have t' « ^crac sign as when the ITPA is used, and because 
the KRT o.rfers no way to easily interpret the value of the gains* 

In T-t' le 5 a i.ummary of the e*ffects of Head Start and kindergarten 
vften the t PA is ujed as the dependent variable is presented for all ten 
. ampiles* '.hn Wescinghovse researchers envisioned that the second and 
.hird grac^e f;amples cuald be used to test for the decay of the Head Start 
effect or the possible presence of a "sleeper" effect where ta»i gains 
lire not manifested until one or more years after the Head St^rt experience* 
H)wever, it i'3 cangerous to interpret the resuJ ts in this manner because 
the study was not longitudinal, but inste i v^s seriei of cross-section 
samples. First, the selection procedures *w d by the H^-^i Start centers 
may have changed over the three-year program; this might lead to noncom- 
parable s&mples* More Importantly, the Head Start progrtiias may have 
changed over the perlcxi — hopefully improved — and a larger effect for the 
first grade children may be indicative of changes in the programs rather 
than decay of the effects. 

For three of the four first grade samples analyzed, Heac? Start has 
a positive effect for Black children, and thes^ effects are equivalent 
to four to ten IQ points; only for the summer, mother only Siunple is the 
effect negative for Blacks (and then it is insignificant and ilmost zero) • 
Thus, the immediate impar :, of Head Start tcr Black children is quite 
favorable. For white children in the firjr, grade Head Start is effective 
only when the father is absei t uCOjn the homOr For the second and third 
grade samples the effect of Hea . Start is not great for Black children 
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except for the gra^le 2 summer sampla where only the mother Is present. 
For white children Head Start cx!..J)its the same pattern of effects as 
for the first grade — Head J^tart has a .i.irly high elfect (greater th&n 
one point on the ITPA) when only the nother is present, and very enmll, 
negative effect when both parents are present* If we discount heavi^/ 
the findings for t.^e Fecond and third grade samples on the grounds that 
Head Start programs wei e not yet fully developed, one may conclude that 
Head Stare is e'fectivt for Black children and for white children from 
mother-headed f . iej . These results are consistent with the overall 
findings of the We.sMnghouse study, but by adding additional control 
variables am oy straclf ing the data to permit the effects of Head Start 
to vary acro.,s ditfeTtHit types of children, we have discovered that Hedd 
Start may be approfriite only for certain groups. 

Kindergarten consistently shows a strong, positive effect for white 
childrer in vll samples, and there is no trend for Black children. How- 
ever, tfc cause he children who attended kindergarten are fron significantly 
more advantages backgrounds, we would not advocate an expansion of kinder- 
gc./i:en prog^a^r on the basis of the findings reported here. 

Although the bias is^iie can not be settled for the Westinghouse data, 
sev3ral technii ues have been rmployed to determine if the Head Start and 
control children differ significantly on socioeconomic and demographic 
character Is, ^.cs that are correlated with cognitive development. Discrimi- 
nant analysis was used to tost the joint hypothesis that Head Start and 
control children differ si{»nif icantly on the Independent variables included 
in the regressions; only for the first grade, summer, both parents present 
sample is there a significant difference favoring the control group. An 
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alternative procedure f^x iie^/ uring t'ese differences is discussed in 
the Appendix. Support of the null hypothesis th t the groups do not 
differ significantly on these variables adds credence to the belief that 
the regressioTis produce unbiased estimates of the effects of Head Start 
bat are irsufficient to guarantee unbiasedness . Besides the fact that 
certain selection procadures will produce unbiased estimates when the 
gio.*ps ^!irfer on their pretreatment cognitive development, we may not 
have the complete set of relevant variables available, e.g. 3 parental 
a'tlLudes toward education. Discriminant analysis was also employed to 
determine if there vers significant background differences between 
children who hav.. attended kindergarten and those who have not. For 
all ten :^amples it was discovered that children who attended kindergarten 
are from significantly more advantaged backgrounds as measured by such 
characteristics as income, family size, and parental educat?on. Thus, it 
is li'-vtily th^t the coefficients for kindergarten are inflated, 

Severn', modifications of the basic regression moilel were formulated 
to t J. sr .> additional hypotheses of interest. It has been observed by 
some researchers (e.g», Kerzog et al,, 1972) that preschool programs may 
prov!< - smaller benefits to the rost disadvantaged children. Such findings 
may r^^lect differential learning abilities by SES, but they may also be 
i.n irlxcation that the curricula of these programs are set at too high a 
le-^e} for the most disadvantaged children. To test the hypothesis that 
Head Start is equally effective for all children in the sample, a set of 
Head Start-SES interaction variables was added to the regression equations. 
The variables were formed by multiplying the Head Start durarav variable by 
each of the independent variables. The hypothejis that the entire set of 
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Head Start-SES interaction coefficients are equal to zero was then tested. 
The Head Start-ethnicity variables (BLCKHS and MEXHS) were not included 
in the test as they are part of the basic model. The null hypothesis was 
not rejected for any of the first grade samples. Because of thest. findings 
we have rejected the hypothesis that the effecc of Head Start is dependent 
upon the socioeconomic status of the child's family. 

A second extension of the model that was carried out for the first 
grade both parents present samples was to add neighborhood and Head Start 
center dummy variables tc the regression equation. The neighborhood vari- 
ables may add explanatory power to the model by controlling for differences 
in neighborhood environments. When the Head Start center variables are 
added to the model each center is considered as a separate treatment; we 
were thus able to test if the Head Start effects for the various centers 
are clustered about the overall effect or if there is a large variation. 
For these analyses only children from neighborhoods with at least four 
Hsad Start and four control children were included. The neighborhood 
variables were added first, and the set of coefficients is significant 
for both the summer and full-year samples when the ITPA mean is used 
as the dependent variable; the addition of these variables did not 
significantly change the coefficients of the previously included indepen- 
dent variables. The 5et of Head Start center variables was added to the 
regression equation with the neighborhood variables also included (so 
that these variables would not serve as a proxy for neighborhood effects) 
and then tested for significance. The set of coefficients for the 
center variables was nc significant at the 10 percent level. Thus, 
there does not appear to be significant variation in the effects of the 
centers analyzed. 
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4. Conclusions and Policy Implications 

Although the reanalysps presented above do not conpletely reverse 
the pessimistic findings of the Westinghouse study, they do indicate 
that Head Start may have a significant immediate impact on children from 
minority groups and white children from mother-headed families. The 
next logical step in this evaluation is to place a dollar value on the 
cognitive gains to determine if the benefits of Head Start exceed the 
costs. Although this decision must ultimately be made, it is beyond the 
author's ability to carry out such an imputation. To make such in., a- 
tions one would prefer to have longitudinal data available so that the 
permanence of the effect:, can be judged. Unfortunately, most of the 
longitudinal studies conducted so far have indicated that whatever cog- 
notive gains are accrued during a preschool program seldom last past the 
f:'rst year of formal schooling; this was the finding in the surveys of 
the literature carried out by Lois-Ellin Datta (1969) and Marian Stearns 
(1971) for the United States Departraent of Health, Education, and Welfare. 

It is clear that not enough is known at the present time to declare 
that Head Start is either a "success" or "failure." The first issue that 
must be addressed is whether or not an immediate gain in IQ of 5 to 10 
points is a sufficient output for a preschool program. If these benefits 
from Head Start are judged to be worth attaining, additional research 
must be carried out to determine why the effects dissipate so rapidly. 
The policy implications for sustaining Initial gains will depend upon 
the reasons for the subsequent loss. If the initial gains are simply an 
artifact due to the change of environment or changes in the children's 
attitudes toward taking tests, then efforts should be made to develop 
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programs which produce real gains or preschool programs should abandon 
the attempt to affect cognltrve development and concentrate on other 
aspects of development. Programs with earlier and later intervention 
should also be tried in case the ages of three to five are not as 
"critical" as was once thought. If it is found that the gains fade 
because they are not reinforced in the public school system, efforts 
should be made to better coordinate preschool and primary school programs. 
A final possible explanation for the fading of cognitive gains is that 
the home environment of disadvantaged children does not suitably reinforce 
the gains of preschool programs such as Head Start; if this is the case 
then efforts can be made to alter the home environment or to place the 
child outside that environment for a longer period of time. 

Any strong conclusion about Head Start or preschool educational 
programs in general would be premature at this time. We do not agree 
with the statement by Arthur Jensen (1969) that "Compensatory education 
has been tried and in apparently has failed." Our research has produced 
some evidence that Head Start may be most effective for those children 
that Jensen predicts can be helped least — Blacks, and white children 
from fatherless families. It must be remembered that preschool education 
for the disadvantaged is a relatively new field. Research should continue 
so that we may learn how to sustain and enhance the initial gains that 
are found. At the same time, we must realize the limitations of preschool 
education. As Jencks et al. (1972) have correctly claimed, education will 
not eliminate inequality in our society. It is possible, however, that 
Head Stait and other preschool educational programs can play an important 
part in the education of disadvantaged children. 
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NOTES 



Although all benefits and costs should be considered in the evalua- 
tion of a social action program, the major objectives should be given the 
most consideration. Cain and Hollister (1969, p. 6) claim that "in general 
the measures of program outputs, vhich may be proxies for the ultimate 
tangible changes, such as income change, employment gain, and educational 
attainment." With regard to Head Start, John W. Evans (1969, p. 254) 
states : 

. . . while Head Start has objectives other than cognitive and 
effective change, these other objectives are in large part instru- 
mental to the cognitive and affective objectives. That Is, the 
program is attempting to improve children medically and nutrition- 
ally in order to make it possible to chaage them cognitively and 
motivationally. 

2 

Cain and Watts (1970, p. 238) make the same point about determining 
the most efficient policy alternative. In their presentation, however, 
they suggest scaling the policy variables so that one unit of any policy 
variable has the same cost. If this procedure is used then the most 
efficient policy variable is the one with the largest regression coefficient. 

3 

Goldberger (1972a) has shown that the true regression of Y on X is 
not linear when selection is made on the basis of X*. Because empirical 
work is generally run using linear approximations, using a linear regres- 
sion reflects what will happen in actual experiments. In qualitative terms, 
the spurious treatment effect retains the same direction of bias, but since 
the within-group regressions are no longer parallel the treatment effect 
calculated in the nonlinear regression will be a function of X. 

4 

The major problem with the affective instruments is that they were 
designed specifically for the Westinghouse evaluation and were not veri- 
fied on large samples. The comments of Victor G. Cicirelli, the principal 
Westinghouse investigator, and several GEO officials cast doubt on the 
validity and robustness of the affective measures. Cicirelli, Evans, and 
Schiller (1970, p. 115) state that "our judgment about the affective find- 
ings should be tentative and this is the view the Westinghouse report took." 
In another article Evans (1970, p. 256) says "No great claims are made for 
the affective instruments." 

^The Hollingshead Index is formed by coding educational attainment and 
occupational status on seven-poxnt scales. The Index is formed by multiply- 
ing the occupational score by seven, multiplying the educational score by 
four, and taking the sum of the two products. Including the information 
in this manner constrains the coefficient for education to be four-sevenths 
of the coefficient for occupation. 
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APPENDIX 

Techniques for Measuring Pretreatment Differences 

There are several approaches that can be used to determine if the 
Head Start and control groups differ significantly 1i\ their pretreatment 
socioeconomic status. Perhaps the simplest is to perform an analysis 
of variance for each trait of interest. The problem with such a technique 
is that there is no good way to combine the results to make an overall 
judgment. Another possibility is to use discriminant analysis to test the 
joint hypothesis that the two groups have the same means on the variables 
of interest. The discriminant analysis can be carried out by regressing 
the dummy variable for Head Start on the same set of independent variables 
included in the educational production function. In the limiting case 
where there is no relationship between the independent variables and group 
membership all the coefficients would be zero and the constant would be 
0.5 ( assuming equal sample size in the two groups). Because we are dealing 
with a 0-1 dependent variable, the fitted values of the dependent variable 
can be interpreted as the probability that a particular observation will 
be in Head Start rather than the control group; this is not precisely 
correct and more sophisticated techniques have been developed for making 
such estimates. The regression coefficients can then be interpreted as 
the change in the probability of membership In the Head Start group for 
a unit change in the independent variables. The F-statistic for the test 
that the entire set of coefficients (excluding the constant) are equal to 
zero is equivalent to the test of the hypothesis that the means of each 
variable are the same in the two groups. 
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There are» however , several weaknesses of discriminant analysis. 

■^f some differences favor the control group and others favor the Head 

Start children, there is no way to take these offsetting differences 

into account. Even if all of the evidence indicates that one group comes 

from a better background, discriminant analysis does not provide any 

information abo^t what these differences will be in terms of cognitive 

development. To get a better idea of the differences in cognitive 

development that would have been obseirved if there had been no treatment, 

we have developed a two-stage procedure as an alternative to discriminant 

analysis. The first stage of the procedure is to regress the cognitive 

measure used as the posttest (such as the ITPA score) on all the indep- 

pendent variables except Head Start for the control group. This produces 

an estimate of the educational production function without Head Start. 

This fitted function is then used to form an imputed test score for 

obsei *ations in both groups; the imputed score is formed by the formula 
k 

YHAT = Z B.X. for each of the observations. In the seccnd stage of the 
i«l ^ 

procedure, uhe variable YHAT is regressed on the dummy variable for Head 

Start. The coefficient for the Head Start variable then relates the 

difference in the average imputed test scores for the two groups. For 

the first grade, summer, both parents sample when the ITPA score is 

used LS .wie dependent variable in the first stage, the second stage 

regression is: YHAT = 19.300 - .764HDSTRT. There were 623 observations 

(183.51) (-5.047) 

for vhe regression and the is .0394. Thus we predict that the Head 
Start children would have scored .764 ITPA points lower than the control 
children if they ha not had Haad Start, and this is roughly equal to 
3.8 IQ points. This procedure does not tell us whether or not the 
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previous estimates of the efifect of Head Start are biased, however, because 
we do not know which selection procedure was used to assign the children 
to Head Start. The procedure may be helpxal in determining how different 
the experimental and control groups are in ex post facto analyses in terms 
of the metric used in the post test. 
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